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In recent years there has been considerable interest in the structure and dynamics of complex 
networks. One of the most studied networks is the linear Barabasi-Albert model. Here we investigate 
the nonlinear Barabasi-Albert growing network. In this model, a new node connects to a vertex 
of degree k with a probability proportional to k a (a real). Each vertex adds m new edges to the 
network. We derive an analytic expression for the degree distribution P(k) which is valid for all 
values of m and a < 1. In the limit a —* — oo the network is homogeneous. If a > 1 there is a gel 
phase with m super-connected nodes. It is proposed a formula for the clustering coefficient which is 
in good agreement with numerical simulations. The assortativity coefficient r is determined and it 
is shown that the nonlinear Barabasi-Albert network is assortative (disassortative) if a < 1 (a > 1) 
and no assortative only when a = 1. In the limit a — > — oo the assortativity coefficient can be 
exactly calculated. We find r = ^ when m — 2. Finally, the minimum average shortest path length 
l m in is numerically evaluated. Increasing the network size, l m i n diverges for a < 1 and it is equal 
to 1 when a > 1. 



PACS numbers: 89.75.Hc, 02.50.- 
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I. INTRODUCTION 

In the late 1990s, the researches on complex networks 
knew an explosive development [ij, |2j, |3j- The impulse 
came from the convergence of interests between differ- 
ent sectors like physics, biology, technology and sociol- 
ogy turning the study of complex networks into a new 
interdisciplinary field. 

A network is a set of vertices or nodes provided with 
some rule to connect them by edges. With such a simple 
definition, it is not a surprise that it has found so much 
utility in so many areas. Following the classification in- 
troduced by Newman pj, one can divide real networks 
into four types: social, biological, technological and in- 
formational. 

Social networks are composed by interacting people 
with some pattern of contacts like friendship, business or 
sexual partners. One of the most popular works in social 
webs was carried out by Milgram [4j who first arrived 
to the concept of " six degrees of separation" and small- 
world. Social networks also includeproblems like the col- 
laboration network of film actors [a, or co-authorship 
among academics 0. 

A number of problems can be mapped into biological 
networks. Some very good examples are: the 282-neuron 
neural network of the nematode C. Elegans @, blood 
vessels and vascular networks 0, food webs in an 
ecosystem with species living in a prey-predator scheme 
[Til Il2^ , network of metabolic pathways and the ge- 
netic regulatory network for the expression of a gene |l4j . 

Technological networks are those built by man in his 
arduous struggle for progress and welfare. Perhaps, the 
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best known example is the electric power grid (high- 
voltage three-phase transmission lines network) |5|, 1 1 5| . 
This category also includes the networks of airline routes 
(l5j . railways and internet structures [r^| . 

The fourth type is the information network. One of 
the oldest example of this kind is the network of citations 
between academic papers [Te| . This network is certainly 
influenced by social relationships and it may be somehow 
contaminated since putting a citation in a paper does not 
mean that the author had actually read it |19|. Another 
important example is the World Wide Web, which is the 
network of informations between Web pages [20||. It is a 
directed network with different power laws for the in and 
out degree distributions. 

Motivated by such a number of applications, a myr- 
iad of theoretical models were proposed aiming to re- 
produce or to describe real-world networks. In many of 
these models, the degree distribution is power law and 
the corresponding network is said scale- free 0, EI- A 
good number of real- world networks are of this kind. 

One widely studied scale-free network is the linear 
Barabasi-Albert model (BA) 0,112 ■ In its general form, 
that we call the nonlinear Barabasi-Albert model (NBA) , 
the network is constructed as follows. A new vertex con- 
nects to another (already existing) vertex i with proba- 
bility Il(fej) = fef/Xw^", where k t is the degree, i. e., 
the number of edges connected to the vertex i and a is a 
real number. Each new vertex adds m new edges to the 
network. In the linear case (a = 1), the model has many 
desired characteristics like to be a scale-free network or 
to have the small- world effect, but it also has some other 
features which are not so coveted: it is no assortative 
and the clustering goes to zero with the increasing size of 
the network. A network is said assortative if it connects 
preferentially nodes with almost the same degree. It is 
something like a social stratification by the incomes. If 
we think of social networks, assortative mixing and clus- 
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tering are both yearned qualities |23| . 

In this paper, we investigate the NBA (the model, not 
the basketball association) by determining some impor- 
tant quantities like the degree distribution, the clustering 
coefficient C, the assortativity coefficient r and the av- 
erage shortest path length I. This is done in the whole 
space of the parameters m and a. We obtain some inter- 
esting analytical as well as numerical results. Writing the 
master equation for the number of vertices with degree fc, 
we derive an expression for the degree distribution P(k). 
If a > 1, the gel phase has to super- connected sites. 
For a — > — oo the NBA becomes a homogeneous lattice 
with all sites having connectivity 2m. Through a sim- 
ple change of stochastic variables, we rederive the known 
stretched exponential form of P(k) in the mean field ap- 
proximation. It is proposed a formula for the clustering 
coefficient which was verified by numerical simulations. 
The behavior of the assortativity coefficient is numeri- 
cally evaluated. The NBA is assortative (disassortative) 
if a < 1 (a > 1) and no assortative only when a = 1. 
In the limit a — > — oo, the assortativity coefficient can 
be calculated exactly. We find r = j§ when m = 2. 
By numerical simulations, the minimum average short- 
est path length l m in is determined. As the size of the 
network increases, l m in diverges if a < 1 and it is equal 
to 1 otherwise. 
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We can now establish the dependence of the amplitude 
fi on a. Using the relation fi = J2k>m k a P{k) together 
with the equation above we obtain the intrinsic relation 



fi = m a [l — m + 



oo 



n 



(5) 



If a < 0, the derived expressions are still valid but 
the NBA model looses its usual preferential attachment 
or capitalistic interpretation. With a negative, the new 
vertex connects to those poorest vertices in a say social- 
istic way. If a = —\\a\\ with ||a|| 3> 1 and assuming the 
connectivity k to be a continuous variable, the degree 
distribution exhibits a maximum at k max — [ m ^ a H ]TF¥ . 
The degree distribution expanded around this maximum 
has the gaussian form 



II. DEGREE DISTRIBUTION USING THE 
MASTER EQUATION 

The probability P{k) that a randomly chosen vertex 
has degree k was obtained in references pil |25| for the 
case m = 1 . Here we derive this probability for any value 
of to. For a network with N vertices, the master equation 
reads 



dN{k) 
dt 



M a 



[(k - l) a N(k - 1) - k a N(k)} + 4,™ , (1) 



where N(k) is the number of vertices with degree k at the 
time t and M a = J2k= m k a N{k) gives the proper normal- 
ization. The first (second) term accounts for the process 
in which a vertex with k — 1 (k) links is connected to the 
new vertex. For the m possibilities, this happens with 
probability m(k - l) a /M a (mk a /M a ). The last term 
accounts for the continuous introduction of new vertices. 

If a < 1, we expect M a to scale linearly in the long- 
time limit, i.e., M a = fit (t ~ N = J2k N ( k ))- Writing 
N(k) — tP(k), where P(k) is the stationary degree dis- 
tribution, and substituting it into the equation above we 
get for k — to 



P(fc)HM|exp(- 



~{k k max ) ) 
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In the limit ||a|| — > oo it approaches the Dirac Delta 
function 8(k ~ k rnax ). The condition ^^kP^k) = 2to 
fixes k max = 2m. The network is now homogeneous with 
all vertices having connectivity 2to. 

If a > 1 and to = 1, the asymptotic behavior of M a 
is t a [24| and there arise one super-connected node to 
which almost every other vertex is connected. When 
to > 1, we found m super-connected nodes. This re- 
sult was confirmed exhaustively by our numerical simu- 
lations. For large enough networks, the degree histogram 
always shows (with no exceptions) to super-connected 
nodes. The probability B that the m initial vertices are 
connected to all other remaining sites of the network can 
be calculated. In the discrete time version process, let 
s = t - 1 (t = N - to). When the (to + 2)-th vertex 
is being aggregated, the probability that it will be con- 
In 



nected to all to initial sites is 9(s 
general, we have 



i) = n 



3=1 j+m" 



1 



(7) 



and 



P(to) 



H - 



,1+ct 



(2) 



B 



(8) 



3 



Clearly, B — when a < 2 and B > otherwise, 
reproducing the results obtained for m = 1 [24j|. Thus 
the threshold a = 2 does not depend on to. 



III. DEGREE DISTRIBUTION IN THE MEAN 
FIELD APPROXIMATION 

In the mean field context, the de gree distribution was 
obtained for the linear BA network [23] and for the NBA 
model with link fluctuations j2||. Here we rederive the 
results using a very simple and straightforward change of 
stochastic variable. 

Assuming the connectivity ki to be a continuous vari- 
able defined on a site i of a network with N vertices, one 
can write 



dt 



V h<* 



(9) 



As in the previous section, if a < 1 we can assume 
E^Li — /LtiV ~ /it. Substituting this into the expres- 
sion above and integrating it, we get 



. m a (l — a) . , t ... 
= m l + i '- ln(-) j 



(10) 



where is the time when the site i was incorporated 
to the network and the initial condition ki{ti) = m was 
used. Now, if one vertex is to be picked randomly then 
the probability that the chosen site is i is P{ti) = ^, so 
the equation above can be seen as a change of stochastic 
variables fj — > ki and the respective distributions are 
related by P(k) = J 5(k - h(U))P(t t )dU. The result is 



P(k) = exp{ 
mk a 



M [( j 1 " 
m"(l — a) m 



1]}, (11) 



which is the asymptotic behavior of the exact P(k) given 
by Eq.(4) (see also [2(|). We will later use this mean 
field result to derive the clustering coefficient in the next 
section. 

If the same procedure is applied to the BA with the 
modified preferential attachment 



dt 



+ A) 



Ef=i(%+A) 
then the degree distribution is given by 
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which, in the asymptotic limit i > 1, is a power law 
k~ 3 ~~ with the exponent tuned by the parameter A. 



IV. CLUSTERING COEFFICIENT 

The clustering coefficient is a transitivity property of 
the network. If a vertex 1 is connected with vertex 2 
and the vertex 2 with the vertex 3 then there is a high 
probability that the vertex 1 is also connected to 3. In 
social networks, this can be easily interpreted as the fact 
that the friend of your friend is likely to be your friend. 
The clustering coefficient of order a: of a site i, Ci(x), 
is defined as the probability that there is a distance of 
length x (measured without passing through i) between 
two nearest neighbors of the site i [27j ■ It is given by 



Ci(x) 



2Vi 



(14) 



where yi is the number of such x distances and ki is the 
degree of the vertex i. For all vertices of the network, 
the average clustering coefficient is C(x) — £i=L^iM. 
In this paper, we will only treat the order x = 1, so we 
drop the index x and use simply C as being the average 
clustering coefficient. Of course, when to = 1, C is always 
equal to zero (independently of the value of a) since the 
graph is a tree like. 

We can derive an analytic expression for C in the mean 
field approximation. Let P(j i) be the probability 
that at the time t — j the vertex j is connected to the 
vertex i. This means that P(j — > i) = m „ ., , 

where ki(j) is the degree of the vertex i at the time j. 
For a < 1, the denominator of this probability is equal 
to \ij and using Eq.(10) we get 



m a (1— a) 



ln(f)]x^ 



j>i. (15) 



When a is equal to one, the expression above reduces 
to f (ij)- 1 / 2 which was first obtained by Klemm and 
Egufluz |23|. 

To find the clustering coefficient C, care must be taken 
with the ordering j > i. They cannot be interchanged. 
If i and j are two nearest neighbors vertices of the node 
I and we assume i < j, then the contributions come from 
3 regimes: l<i<j,i<l<j and i < j < I. In this 
way, the average clustering coefficient can be written 



m{N) 



j=3 i=2 1=1 

f^P(HP(H)^ 

j=3 1=1 i=l v ' 

^ P(l -> i)P(l -> j)P(j -> j) 1 

ZvLL ^ ] . ( 16 ) 

1=3 ]=2 i=l ly ' 

where ni (N) = k ^ N ^ k ^ N '>~ 1 ^ [ s the total number of pairs 
of neighbors that the node / has at the time (or size) N. 



4 



For all values of a < 1 , the clustering coefficient goes to 
zero with N. There are two particular cases in which the 
asymptotic behavior of C is known exactly: when a = 
(random graph) Coc^ and a = 1, C(N) oc A^ _1 (lnA^) 2 
|28j . The latter form is more consistent with the numeri- 
cal simulations results than an earlier theoretical predic- 
tion (C(N) oc 7V-°- 75 j2^). 

We simulate the nonlinear NBA model in networks 
with sizes varying from N = 100 up to N = 102,400 
and m — 2. For each size, the average clustering co- 
efficient was determined and then average it over 20 to 
24,000 independent runs. In the Fig.l, we plot the de- 
pendence of C with a for two network sizes N = 102, 400 
and N = 25, 600. 
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FIG. 2: Comparison between numerical simulation and the- 
ory. Black squares are the simulated points and the open 
circles correspond to the proposed analytical formula 



probability distribution that a randomly chosen edge has 
degrees j and k at either end. It obeys the sum rules 



jk 



= 1 > ^2 e jk = qk- 



(17) 



FIG. 1: The average clustering coefficient C versus the expo- 
nent q for two different network sizes N. 



Originally, j and k were defined as the remaining de- 
grees - the number of edges leaving the vertex other than 
the one we arrived alo ng [321. We can define the assor- 
tativity coefficient r as |30t l31| 



In the Figure 2, we compare the results of the analyti- 
cal expression (eq.(16)) with those of the simulated NBA 
model. 

For a > 1, the clustering coefficient rapidly approaches 
the value 1 as we increase the size N. This result comes 
from the fact that the m super-connected nodes are also 
inter-connected. Thus, when a new vertex is added to 
the network, the strong preferential dynamics (alpha l 1) 
forces it to connect with the m super-connected nodes, 
leading to a clustering coefficient value close to one. 



V. MIXING PATTERNS 

One important aspect of the networks is how the ver- 
tices are linked. If there is a tendency to connect vertices 
with almost the same degree, we call it an assortative 
mixing. When the links favor nodes of very different 
degrees the system is said disassortative. The neutral 
situation is named no assortative [30l l3l| . To measure 
the assortative mixing level, we slig htly change the defi- 
nitions given by Callaway et al. |32j. Let ejk be the joint 



r = — ^2.i k ( e jk - QjQk), (18) 

where cr 2 = J2k^ 2 1 k ~ Efc^9fc] 2 i s trie variance of 
the distribution q^. The quantity r lies in the interval 
— 1 < r < 1. It has the value +1 (—1) if the system is 
perfectly assortative (disassortative) and it is zero in the 
no assortative case. 

In our simulations, we determine r and average it in a 
number of 20 to 24, 000 runs depending on the network 
size N. In the Figure 3 we plot the assortative coefficient 
r against the exponent a. In the limit of an infinite 
lattice, r = 0ifa = l,a result already known [3fl |. 
Our numerical data show that this is the unique case. 
The NBA model is assortative (r > 0) for a < 1 and 
disassortative (r < 0) when a > 1. 

In the limit a — > — oo the assortativity coefficient can 
be analytically calculated. For example, if rn = 2 then 
the number of vertices N(k) with degree k is N(2) = 1, 
TV (3) = 6, N(4) = N - 7 and zero for k > 5. The mean 
degree < k > is equal to 4— -j^. The joint probability 
distribution = 2,3,4) can be easily evaluated 
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FIG. 3: The dependence of the assortativity coefficient r with 
the exponent a. 



FIG. 4: The minimum average shortest path length l T} 
sus the exponent a for two different network sizes. 




Using the definition (Eq. 18) and the results above one 
can get straightforwardly the exact value r = j^. 



VI. AVERAGE SHORTEST PATH LENGTH 

With all edges of a network having unit length, the dis- 
tance between two vertices of that network is the size of 
the shortest path length between them (geodesic). When 
we take into account all possible pairs of vertices, the 
mean value corresponds to what is known as the average 
shortest path length I (sometimes also called the diam- 
eter). It is a fundamental concept in networks studies 
and very important in the field of communications and 
computer's web where routing and searching are common 
tasks. 

To estimate I which is defined as the 

average distance of all vertices on the graph measured 
from the vertex with the highest degree in the network 
(if there is more than one, we choose one by chance). The 
average shortest path length I is restricted to the interval 

M 



(19) 



We determine l m i n for several values of a and m by 
simulating the nonlinear Barabasi-Albert in networks 
with sizes varying from N — 100 up to N — 102, 400 
and averaging it over 20 to 24, 000 independent realiza- 
tions. Figure 4 shows how l m in depends on a for two 



different networks sizes. In the limit N — » oo, l min is 
infinite for a < 1 and equal to 1 when a > 1. 

Let us now discuss the dependence of l min with N. We 
conjecture that both I and l m in scale with N in the same 
way. It is well known that random networks, such as 
Erdos-Renyi networks 0j35|, or partially random like 
the small- world networks [a], have an average shortest 
path length scaling as I ~ ln(iV). For scale- free networks 
having a degree distribution P(k) ~ fc~ A , it was proved 
that I ~ ln(ln(A0) (J ~ ln(iV)) if 2 < A < 3 (A > 3) 
|33j |. i. e., they form an ultra-small- world (small- world) . 
In the frontier value A = 3, the expected dependence is 

' ln(ln(JV)) I** ■ 

In the mean field approximation, the linear Barabasi- 
Albert model (a — 1) has an exponent A = 3 which is 
independent of m. However, in the real linear Barabasi- 
Albert model, simulated in networks with a few mil- 
lions nodes, the exponent A is actually less then 3 [22| . 
Through a careful analysis of our numerical data, we got 
the exponent A = 2.91 ± 0.03. This give us an opportu- 
nity to check two theoretical predictions: if m = 1, the 
linear BA is a graph tree and the average shortest path 
length scales as I ~ ln(iV) yh if m > 2, the expected de- 
pendence is I ~ ln(ln(A)) once actually A < 3. Our 
numerical results corroborate these theoretical previsions 
as can be seen in the Fig. 5. 



VII. CONCLUSIONS 

We studied many properties of the nonlinear Barabasi- 
Albert network. Using the master equation, we derive 
an analytic expression for the degree distribution P(k) 
which holds for all integer m > 1 and real a < 1. In par- 
ticular, the situation a < was focused for the first time. 
This sector had been neglected in most of the previous 
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FIG. 5: The linear BA model (a = 1). Plot of the minimum 
average shortest path length lmin versus ln(iV) and ln(ln(iV)) 
for m = 1 and m = 5, respectively. 



works. We claim that it might be useful in some social 
networks (since the system is highly assortative there) or 
in the study of the crossover from the small-world regime, 
where the shortest path length grows up logarithmically 
with the network size, to the large-world regime, where 
it grows up faster with some power of the network size. 
In the limit a — > — oo, the network is homogeneous with 
connectivity 2m. If a > 1, there are to super-connected 
vertices in the gel phase. The probability that the m 
initial vertices are connected to all the other sites is non 
null only when a > 2 and this threshold a = 2 does not 
depend on to. 

We proposed an analytic formula for the average clus- 
tering coefficient C. Its validity was verified by numerical 
simulations. For any fixed network size, C is a monoton- 
ically increasing function of a. If N — > oo, the clustering 
coefficient falls to zero for all a < 1 and rapidly ap- 
proaches its maximum value C — 1 when a > 1. 

The mixing patterns of the NBA model were deter- 
mined by the assortativity coefficient r. If a < 1, 
the assortativity coefficient r increases with N and con- 
verges asymptotically to some real positive value which 
is smaller than one; if a = 1, r decreases with the size 
N and goes to zero in the limit of an infinite network; if 
a > 1, r diminishes with the size but now converges to 
the value —1. In other words, NBA is assortative (disas- 
sortative) if a < 1 (a > 1) and no assortative only when 
a = 1. In the limit a — > — oo the assortativity coefficients 
can be exactly calculated. We find r = when m = 2. 

Through the minimum average shortest path length 
lmin we estimate the diameter of the NBA. For a fixed 
network size, l m in is a monotonically decreasing function 
of a. It goes to infinite if a < 1 and is equal to 1 when 
a > 1. In the particular (linear) ln(7V) 
if to — 1 and l m i n ~ ln(ln(iV)) when m > 2. 
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